Bias in sensitivity and specificity caused by data-driven selection of optimal cutoff values: mechanisms, magnitude, and solutions.
نویسندگان
چکیده
BACKGROUND Optimal cutoff values for tests results involving continuous variables are often derived in a data-driven way. This approach, however, may lead to overly optimistic measures of diagnostic accuracy. We evaluated the magnitude of the bias in sensitivity and specificity associated with data-driven selection of cutoff values and examined potential solutions to reduce this bias. METHODS Different sample sizes, distributions, and prevalences were used in a simulation study. We compared data-driven estimates of accuracy based on the Youden index with the true values and calculated the median bias. Three alternative approaches (assuming a specific distribution, leave-one-out, smoothed ROC curve) were examined for their ability to reduce this bias. RESULTS The magnitude of bias caused by data-driven optimization of cutoff values was inversely related to sample size. If the true values for sensitivity and specificity are both 84%, the estimates in studies with a sample size of 40 will be approximately 90%. If the sample size increases to 200, the estimates will be 86%. The distribution of the test results had little impact on the amount of bias when sample size was held constant. More robust methods of optimizing cutoff values were less prone to bias, but the performance deteriorated if the underlying assumptions were not met. CONCLUSIONS Data-driven selection of the optimal cutoff value can lead to overly optimistic estimates of sensitivity and specificity, especially in small studies. Alternative methods can reduce this bias, but finding robust estimates for cutoff values and accuracy requires considerable sample sizes.
منابع مشابه
SURVEY ON THE SENSITIVITY AND SPECIFICITY OF BETA ANGLE IN DIFFERENTIATING SKELETAL CLASS II AND III MALOCCLUSIONS FROM CLASS I
Background & Aims: A successful treatment in the field of medical sciences depends on an accurate diagnosis. In orthodontic diagnosis and treatment planning also analyzing the sagittal jaw base relationship is important. Various methods have been suggested for this. This study aimed to investigate the accuracy of beta angle in sagittal jaw base relationship diagnosis. Materials & Methods: In t...
متن کاملبررسی فاکتورهای خطر کلینیکی ((Clinical Risk Index for babies در پیش گویی بقای نوزادان با وزن بسیار کم
Objective: Very low birth weight(VLBW) babies constitute approximately 4%-7% of all live births and the mortality in this subgroup is high, contributing to as much as 30% of early neonatal deaths. Some scoring system like the Clinical Risk Index for Babies(CRIB) Score and the Score for Neonatal Acute Physiology (SNAP), for assessing the risk of mortality frequently utilized in newborns....
متن کاملروشی نوین در کاهش نوفه رایسین از مقدار بزرگی سیگنال دیفیوژن در تصویربرداری تشدید مغناطیسی (MRI)
The true MR signal intensity extracted from noisy MR magnitude images is biased with the Rician noise caused by noise rectification in the magnitude calculation for low intensity pixels. This noise is more problematic when a quantitative analysis is performed based on the magnitude images with low SNR(<3.0). In such cases, the received signal for both the real and imaginary components will fluc...
متن کاملSensitivity and Specificity of CA 15-3 in Detection of Breast Cancer Recurrence
Background: The value of CA15-3(cancer antigen 15-3) marker in early detection of breast cancer recurrence has been studied in several prospective trials. But the results of these studies are different. This may be due to variable cutoff points used for analysis, different intervals between CA15-3 measurements and the differences between patients population. This study was done to examine the p...
متن کاملInvestigation of the Sensitivity and Specificity of the Persian Version of the New Multidimensional Depression Scale in Diagnosing Depressive Disorder
Objectives: The accuracy of diagnosis in mental disorders, such as depression is the basis of correct treatment. The present study aimed to investigate the sensitivity and specificity of the new multidimensional depression scale in diagnosing depressive disorder. Methods: Two groups of participants were assessed by the new multidimensional depression scale (NMDS) and structured clinical inter...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Clinical chemistry
دوره 54 4 شماره
صفحات -
تاریخ انتشار 2008